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I. REAL PARTIES IN INTEREST 
The real party in interest is Monsanto Company, the parent of wholly-owned subsidiary 
DeKalb Genetics Corporation, the assignee of this application. 

n. RELATED APPEALS AND INTERFERENCES 
There are no interferences or appeals for related cases. 

HI. STATUS OF THE CLAIMS 
Claims 1-100 were filed with the original application on June 23, 2000. Claims 1-71, 74- 
77, 80-83, 85, 97 and 92-93 were canceled and claims 78, 84, 86, 88-91, 94, 95, and 100 were 
amended in a Preliminary Amendment filed concurrently with the application. Claims 100-1 14 
were added in a Supplemental Preliminary Amendment, mailed in the case on November 1, 
2001. 

Claims 72, 73, 78, 79, 84, 86, 88-91 and 94-110 were indicated in the first Office Action 
as pending in the case. It is thus believed by Appellants that the Examiner had renumbered 
claims 100-114 to 96-110, due to failure to use claim numbers 96-99 in the application as filed 
and in subsequent Preliminary Amendments. Added claims 100-1 14 have thus been renumbered 
to 96-110. 

Claims 72, 73, 78, 79, 97 and 98 were amended in Appellants response to the first Office 
Action, mailed June 13, 2002. Claims 72-73, 78-79, 84, 86, 88-91, 94-110 were pending at the 
time of the final Office Action mailed in the case on September 27, 2002 and were finally 
rejected. No amendments have been filed after the final Office Action. The final rejection of 
claims 72-73, 78-79, 84, 86, 88-91, 94-110 is the subject of the instant appeal. A copy of the 
appealed claims is attached hereto as APPENDIX 1 . 
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IV. STATUS OF AMENDMENTS 
No amendments were made subsequent to the final Office Action. 

V. SUMMARY OF THE INVENTION 
The invention relates to transgenic maize plants having increased starch content and 
extractability and methods of use thereof. Specification at page 6, line 13 to page 7, line 2. 
Increased starch content and extractability is obtained by the expression of RNA molecules that 
are substantially identical or complementary to l9kD or 22kD a-zein plant seed storage proteins, 
thereby decreasing the amount of corresponding seed storage protein and increase in starch 
content and extractability in the cells of the plant. Specification at page 6, line 13 to page 7, line 
23. 

VI. ISSUES ON APPEAL 

(1) Are claims 72-73, 78-79, 84, 86, 88-91 and 94-110 properly rejected under 35 
U.S.C. §112, first paragraph, as not being enabled by the specification? 

(2) Are claims 88-89, 90, 91 and 102-107 properly rejected under 35 U.S.C. §112, 
second paragraph, as being indefinite for failing to particularly point out the subject matter which 
Applicant regards as the invention? 

VII. GROUPING OF THE CLAIMS 
The claims stand or fall together. 
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VIII. SUMMARY OF THE ARGUMENT 

A. The Examiner has failed to establish a prima facie case of lack of enablement 
under 35 U.S.C. § 1 12, first paragraph, and improperly ignores evidence submitted by Appellant 
affirmatively demonstrating the enablement of the claims. 

B. The terms "substantially identical" and "substantially complementary" are not 
indefinite under 35 U.S.C. § 112, second paragraph, as they are used in the claims as one of skill 
in the art would readily understand the meaning of the terms when viewed in light of the 
language of the claim and teaching of the specification. 

Vffl. ARGUMENT 
A. Rejection Under 35 U.S.C. §112. First Paragraph 

Claims 72-73, 78-79, 84, 86, 88-91 and 94-110 were finally rejected under the first 
paragraph of 35 U.S.C. §1 12 as allegedly not being enabled by the specification. In particular, 
the Examiner alleged that the specification does not teach one of skill in the art how to increase 
the starch content or starch extractability or kernel hardness of seeds. It is stated that the 
specification only teaches how to make maize seeds with decreased amounts of the amino acid 
leucine and increased lysine by transforming Zea mays plants with SEQ ID NOs:l and 2 
operably linked to a Z10 promoter. 

1. The Examiner Has Failed to Establish a Prima Facie Case of Lack of Enablement 
The claims are fully enabled by the specification. No evidence has been put forth by the 
Examiner sufficient to doubt enablement. In this regard, Appellants note that the Examiner has 
apparently placed the burden on Applicant to affirmatively establish enablement. For example, 
on page 3 of the final Office Action, the Examiner states that "Applicant does not address the 
issue of enablement, rather Applicant only addresses the validity or applicability of references 



25258162.1 



5 



cited by the Examiner in support of undue experimentation to make and/or use the claimed 
invention." However, what Appellants had done in the previous response to the first Office 
Action is demonstrate that the Examiner had failed to establish any basis to doubt enablement, 
while affirmatively demonstrating why the very references cited by the Examiner supported 
enablement. The examiner has, therefore, failed to meet the burden of establishing a prima facie 
case of lack of enablement. An unsupported allegation of lack of enablement does not meet the 
PTO's burden under 35 U.S.C. § 1 12, first paragraph, "[otherwise, there would be no need for 
the applicant to go to the trouble and expense of supporting his presumptively accurate 
disclosure." In re Marzocchi, 169 U.S.P.Q. 367, 369-70 (CCPA 1971). 

2. The References Cited by the Examiner Support the Enablement of Appellants 
Claims 

In Appellants response to the first Office Action, the lack of any support for the 
enablement rejection was demonstrated. The Examiner attempted to support the rejection by 
citing two references: Coleman et al. (1997) (EXHIBIT A) and Marks et al. (1985) (EXHIBIT 
B). Coleman et al. was cited as showing that high-lysine mutants exhibiting a reduction of a- 
zein content were "concomitant with an inferior endosperm quality." It was thus suggested that, 
because of the supposedly inferior endosperm quality, "reducing the a-zein protein content of 
maize seeds using the strategy of Applicants, will not increase the starch content or starch 
extractability of maize seeds." However, the reference does the very opposite and demonstrates 
the enablement of the claims. This "inferior" endosperm is in fact a soft and starchy endosperm. 
(See EXHIBIT A, p. 7094, paragraph 2). The reference therefore shows the direct correlation 
between increased lysine, decreased a-zein and soft and starchy endosperm. The first Office 
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Action acknowledges this on page 5. 1 Demonstration of this correlation supports enablement, 
given that the final Office Action has already acknowledged that Appellants have demonstrated 
enablement for increasing lysine. The fact that Coleman et al. also demonstrated successful 
expression of a 24 kDa a-zein gene to induce the mutant phenotype provides still further 
evidence of enablement of the instant claims. 

It is further noted that whether the endosperm is subjectively "inferior" or not is 
irrelevant to enablement. What is considered inferior for a plant used for one purpose, for 
example, for human consumption, is not necessarily the same as for a plant used for production 
of corn starch. Further, an "inferior" endosperm does not equate to an inability to increase starch 
content or extractability, as described in Coleman et al. 

The other reference cited in the rejection, Marks et al. (1985) (EXHIBIT B), was cited as 
allegedly indicating that there are many different forms of 19 kD and 22 kD a-zeins with 
divergent and unpredictable functions. However, this reference also supports the enablement of 
the invention. Marks et al. states in the abstract that: 

A comparison of the DNA and protein sequences of a group of zein cDNA clones 
reveals that they share extensive sequence homology and probably originated 
from a common ancestral gene. A comparison of clones corresponding to Mr 
22,000 polypeptides shows that they are 92% homologous, while five clones 
corresponding to Mr 19,000 zein vary in homology from 75 to 95%. (emphasis 
added) 

Marks et al, therefore, demonstrates the common structural characteristics shared among 19 kD 
and 22 kD a-zeins. In the first paragraph of the Discussion section of Marks et al, it is indicated 
that cDNA sequences among the 19 kD and 22 kD group of a-zein sequences are 75 to 95% and 
92% homologous, respectively. Further, Marks et al provides sequence information and 



1 See middle paragraph: "Two 'high -lysine' mutants were identified, opaque2 (o2) and floury2 (fl2) 
which have a higher lysine content due to a reduction in the a-zein protein content of the endosperm." 
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compansons among 19 kD and 22 kD a-zeins. The disclosure of Marks et al. thereby 
demonstrates the shared structural characteristics of the 19 kD and 22 kD cx-zein seed storage 
proteins. 

No basis has been provided by the Examiner to indicate that different isoforms have 
"divergent functions." The high degree of homology and indicated common ancestor among 
zeins strongly contradicts this assertion. Additionally, there is no support in the Marks et al. 
reference for the contention that any of the mRNA isoforms described encode proteins other than 
zeins, the function of which is that of a seed storage protein. The reference, therefore, does not 
indicate divergent functions, but rather indicates the shared original and conserved structure of 
the 19 kD and 22 kD ct-zein plant seed storage proteins. 

3. The Examiner Improperly Ignored Data Submitted by Appellants Affirmatively 

Demonstrating Enablement of the Claims 
In doubting Appellants evidence of enablement, the Examiner noted that, of four 
independent lines representing 8 samples in Table V on page 78 of the specification, only three 
samples exhibited a statistical increase in lysine {e.g., show expression of the zein seed storage 
protein) compared to a control. However, this is irrelevant to enablement. What is relevant is 
that the Application teaches one of skill in the art to make and use the claimed invention without 
undue experimentation. An indicated success rate of 3/8 cannot reasonably indicate undue 
experimentation, as it would be well within the capabilities of one of skill in the art to readily 
produce many lines expressing 19 kD and 22 kD a-zein plant seed storage proteins without 
undue experimentation using the methodology described in the specification. No basis has been 
provided to suggest why one of skill in the art would reasonably conclude otherwise. The fact 
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that three of eight samples exhibited a statistical increase in lysine demonstrates the repeatability 

and enablement of the technique. 

Further evidence of enablement of the claims is provided at page 83 of the specification. 
There it is described that endosperm cells in a maize kernel are made up primarily of large starch 
granules and protein sequestered in protein bodies. It is shown that a reduction in the number of 
protein bodies in endosperm cells denved from a transformant produced using a zein antisense 
construct was achieved. This was shown by light microscopy and the results given in FIG. 9. As 
can be seen in the figure, the results demonstrated a decrease in the amount of seed storage 
protein and therefore an increase in the relative starch content of the kernel. 

In conclusion, the Examiner has failed to set forth any basis for doubting the enablement 
of the claims. Appellants have further affirmatively set forth evidence of the enablement of the 
claims that has not been contradicted by the Examiner. The rejection must therefore be reversed. 

B . Re jection ofClaims Under 35 U.S.C Ml?.. Second Paragraph 

Claims 88-89, 90, 91 and 102-107 were finally rejected under 35 U.S.C. §112, second 
paragraph, as allegedly being indefinite for failing to particularly point out the subject matter 
which Applicant regards as the invention. In particular, the Action rejects the claims as being 
indefinite for the recitation of "substantially complementary to all or a portion" and 
"substantially identical to all or a portion." In particular, it is stated that the recited phrases are 
vague and unclear and do not specify what portion or percent of the sequence applicants are 
referring to. 
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In response, it is noted that the terms "substantially identical" and "substantially 
complementary" are defined in the specification at page 12, lines 1 1-24. The use of the terms in 
the claims is thus not indefinite. Still further, the claims define the terms by indicating that 
expression of the seed storage protein RNA decreases the amount of seed storage protein. Thus 
the meaning of the terms to one of skill in the art is clear, e.g., that substantially identical to or 
complementary to all or a portion refers to those sequences that are complementary such as to 
hybridize with the seed storage protein mRNA in vivo to cause antisense suppression. Such a 
sequence could not represent a single homologous base pair, as it is well known to those of skill 
in the art that in order to form the type of stable complex required for antisense suppression, 
longer stretches of complementary sequences are required. 

The cited terms are fully definite in compliance with the statute. The test for defmiteness 
under 35 U.S.C. § 1 12, second paragraph is whether "those skilled in the art would understand 
what is claimed when the claim is read in light of the specification." Orthokinetics, Inc. v. Safety 
Travel Chairs, Inc., 806 F.2d 1565, 1576, 1 USPQ2d 1081, 1088 (Fed. Cir. 1986). If one skilled 
in the art is able to ascertain the meaning of the claim, 35 U.S.C. § 112, second paragraph, is 
satisfied. Id. Given the definition in the specification and claim language, the referenced terms 

fully meet this requirement. 

In conclusion, Appellants note that the rejected claims are fully definite in compliance 
with 35 U.S.C. § 1 12, second paragraph. The rejection must therefore be reversed. 
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IX. CONCLUSION 
It is respectfully submitted, in light of the above, that none of the appealed claims lack 
enablement or are indefinite. Therefore, Appellants respectfully request that the Board reverse 
the pending grounds for rejection. 

Respectfully submitted, 



FULBRIGHT & JAWORSKI L.L.P 
600 Congress Avenue, Suite 2400 
Austin, Texas 78701 
(512) 536-3085 




cobert E. Hanson 
'Reg. No. 42,628 
Attorney for Appellants 



Date: February 24, 2003 
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APPENDIX 1 : APPEALED CLAIMS 



72. A fertile transgenic lea mays plant having an increased starch content, the genome of 
which is stably augmented by a preselected DNA sequence encoding an RNA molecule 
which is substantially identical, or complementary, to an mRNA encoding a 19kD or a 
22kD a-zein plant seed storage protein, wherein the preselected DNA sequence is 
expressed in the cells of the transgenic plant in an amount sufficient to decrease the 
amount of said seed storage protein and increase starch content in the cells of a plant 
which only differ from the cells of said transgenic plant in that said preselected DNA 
sequence is absent, and wherein said preselected DNA sequence is transmitted through a 
complete normal sexual cycle of the transgenic plant to the next generation. 

73. A fertile transgenic lea mays plant, the seeds of which have an increased starch 
extractability, the genome of said plant which is stably augmented by a preselected DNA 
sequence encoding an RNA molecule which is substantially identical, or complementary, 
to an mRNA encoding a 19kD or a 22kD a-zein plant seed storage protein, wherein the 
preselected DNA sequence is expressed in the seeds of the transgenic plant in an amount 
sufficient to decrease the amount of said seed storage protein and increase the starch 
extractability of the seed relative to the amount of said seed storage protein and starch 
extractability in the seeds of a plant which only differ from the seeds of said transgenic 
plant in that said preselected DNA sequence is absent, and wherein said preselected DNA 
sequence is transmitted through a complete normal sexual cycle of the transgenic plant to 
the next generation. 

78. A seed derived from the plant of claim 72 or 73, wherein the seed comprises said 
preselected DNA sequence. 

79. A progeny plant derived from the seed of claim 78, wherein the plant comprises said 
preselected DNA sequence. 
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84. The transgenic plant of claim 72 or 73, wherein the promoter comprises the 10 kD zein 

promoter. 

86. The transgenic plant of claim 72 or 73, wherein the promoter comprises the 27kD zein 
promoter. 

88. The transgenic plant of claim 72 or 73, wherein the preselected DNA sequence, which 
encodes an RNA molecule substantially complementary to all or a portion of an mRNA 
encoding a seed storage protein, encodes an RNA molecule substantially complementary 
to all or a portion of an mRNA encoding 19 kD a-zein protein. 

89. The transgenic plant of claim 72 or 73, wherein the preselected DNA sequence, which 
encodes an RNA molecule substantially complementary to all or a portion of an mRNA 
encoding a seed storage protein, encodes an RNA molecule substantially complementary 
to all or a portion of an mRNA encoding a 22 kD a-zein protein. 

90. The transgenic plant of claim 72 or 73, wherein the preselected DNA sequence, which 
encodes an RNA molecule, substantially identical to all or a portion of an mRNA 
encoding a seed storage protein, encodes an RNA molecule substantially identical to all 
or a portion of an mRNA encoding a 19 kD a-zein protein. 

91 . The transgenic plant of claim 72 or 73, wherein the preselected DNA sequence, which 
encodes an RNA molecule substantially identical to all or a portion of an mRNA 
encoding a seed storage protein, encodes an RNA molecule substantially identical to all 
or a portion of an mRNA encoding a 22 kD a-zein protein. 

95. The transgenic plant of claim 72 or 73, wherein the cell is transformed by a method 
selected from the group consisting of electroporation, microinjection, microprojectile 
bombardment, and liposomal encapsulation. 
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96. The transgenic plant of claim 78 or 79, further comprising stably transforming the cells 
with at least one selectable marker gene. 

97 A method of producing a Zea mays seed with an increased starch content, compnsing: 

(a) growing a transgenic Zea mays plant, the genome of which is augmented 
with a preselected DNA sequence encoding an RNA molecule which is 
substantially identical, or complementary to an mRNA encoding a 19kD 
or a 22kD a-zein seed storage protein, wherein the preselected DNA 
sequence is expressed in the cells of the Zea mays plant in an amount 
sufficient to decrease the amount of seed storage protein; and 

(b) selecting a seed from the transgenic Zea mays plant, wherein the seed has 
an increased amount of starch relative to the amount of starch in a seed 
selected from a plant which does not comprise said preselected DNA 
sequence. 

98. A method of obtaining starch from a Zea mays seed, comprising: 

(a) growing a transgenic Zea mays plant, the genome of which is augmented 
with a preselected DNA sequence encoding an RNA molecule which is 
substantially identical, or complementary, to an mRNA encoding a 19kD 
or a 22kD a-zein seed storage protein, wherein the preselected DNA 
sequence is expressed in the cells of the Zea mays plant in an amount 
sufficient to decrease the amount of seed storage protein; 

(b) obtaining seed from said plaint; and 

(c) extracting starch from the seed. 



99. 



100. 



101. 



The method of claim 97 or 98 wherein the preselected DNA sequence is operably linked 
to a promoter functional in plant cells. 

The method of claim 99 wherein the promoter comprises the 10 kD zein promoter. 
The method of claim 99 wherein the promoter comprises the 27 kD zein promoter. 
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102. The method of claim 97 or 98 wherein the preselected DNA sequence encodes an RNA 
molecule which is substantially identical to all or a portion of the mRNA encoding a seed 
storage protein. 

103. The method of claim 97 or 98 wherein the preselected DNA sequence encodes an RNA 
molecule which is substantially complementary to all or a portion of the mRNA encoding 
a seed storage protein. 

104. The method of claim 102 wherein the preselected DNA sequence encodes an RNA 
molecule substantially identical to all or a portion of mRNA encoding a 19 kD a-zein 
protein. 

105. The method of claim 102 wherein the preselected DNA sequence encodes an RNA 
molecule substantially identical to all or a portion of an mRNA encoding a 22 kD a-zein 
protein. 

106. The method of claim 103 wherein the preselected DNA sequence encodes an RNA 
molecule substantially complementary to all or a portion of an mRNA encoding a 19 kD 
a-zein protein. 

107. The method of claim 103 wherein the preselected DNA sequence encodes an RNA 
molecule substantially complementary to all or a portion of an mRNA encoding a 22 kD 
a-zein protein. 

108. The method of claim 97 or 98 wherein the genome of the transgenic Zea mays plant is 
further augmented with a DNA sequence encoding a polypeptide that provides the 
transgenic Zea mays plant with increased kernel hardness. 

109. The method of claim 97 or 98 wherein the transgenic Zea mays plant is produced from 
cells transformed by a method selected from the group consisting of electroporation, 
microinjection, microprojectile bombardment, and liposomal encapsulation. 



25258162.1 



15 



110. The method of claim 97 or 98 wherein the genome of the transgenic Zea mays plant is 
further augmented with at least one selectable marker gene. 
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APPENDIX 2: EXHIBITS 
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Vol 94, pp. 7094-7097, June 1997 

Plant Biology 

Expression of a mutant a-zein creates the floury 2 phenotype in 
transgenic maize 

(endosperm/prolamin) 

Craig E. Coleman *t, Amy M. Clore*, Jerry P. Ranch*, Regina Higgins*, Mauricio A. Lopes§, 
and Brian A. Larkins** 

•Department of Plant Sciences, University of Arizona, Tucson, AZ 85721; ^Pioneer Hi-Bred International, Inc., 7300 NW 62nd Avenue, Johnston, IA 50131-1004; 
and *Centro Nacional de Pesquisa de Milhoe Sorgo-Empresa Brasicira de Pesquisa Agropecuaria, Caixa Postal 151, 35700 Sete Lagoas, MG, Brazil 



Contributed by Brian A. Larkins, May 6, 1997 

ABSTRACT The maize floury 2 mutation results in the 
formation of a soft, starchy endosperm with a reduced amount 
of prolamin (zein) proteins and twice the lysine content of the 
wild type. The mutation is semidominant and is associated 
with small, irregularly shaped protein bodies, elevated levels 
of a 70-kDa chaperone in the endoplasmic reticulum, and a 
novel 24-kDa polypeptide in the zein fraction. The 24-kDa 
polypeptide is a precursor of a 22-kDa a-zein protein that is 
not properly processed. The defect is due to an alanine-to- 
valine substitution at the C-terminal position of the signal 
peptide, which causes the protein to be anchored to the 
endoplasmic reticulum. We postulated that the phenotype 
associated with the floury2 mutation is caused by the accu- 
mulation of the 24-kDa a-zein protein. To test this hypothesis, 
we created transgenic maize plants that produce the mutant 
protein. We found that endosperm in seeds of these plants 
manifests the floury2 phenotype, thereby confirming that the 
mutant a-zein is the molecular basis of this mutation. 



Zeins are prolamin storage proteins that accumulate in the 
endosperm of maize (Zea mays L.) seeds. They are composed 
of four different types of polypeptides, classified as a-, 0-, y-, 
and 5-zeins (1). Accretions of zein proteins form spherical 
protein bodies within the lumen of the endoplasmic reticulum 
(ER), and there is a distinct spatial arrangement of these 
proteins within a protein body; ]3- and 7- zeins are located on 
the periphery, whereas a- and 6-zeins are found in the interior 
(2, 3). Collectively, the zein proteins are rich in glutamine and 
proline, but they lack lysine and tryptophan. Because zeins 
constitute such a large proportion of the total seed protein 
(60-70%), the amino acid composition of these proteins causes 
the grain to be of inferior nutritional quality for monogastric 
animals. 

Efforts to improve the protein quality of maize seed have 
focused on mutants in which zein synthesis is reduced and the 
lysine content is increased. The first "high-lysine" mutants to 
be identified were opaque2 (o2) and floury2 (J12) (4, 5). 
Unfortunately, the favorable nutritional quality of these mu- 
tants is offset by the inferior physical properties of their soft, 
starchy endosperms. It appears that the starchy endosperm of 
the o2 and fl2 mutants is caused by changes in the nature of 
their protein bodies. The o2 mutation affects a transcriptional 
activator of a subset of a-zein genes, leading to a reduction in 
a-zein protein synthesis and the formation of protein bodies 
that are significantly smaller than normal (6-9), The fl2 
mutation, which is semidominant, causes a decrease in syn- 
thesis of all classes of zeins, and the resultant protein bodies are 
not only smaller than normal, but they are also asymmetrical 
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and misshapen (10, 11). Another feature of fl2 endosperm is 
the overexpression of the ER-resident binding protein (BiP), 
which becomes deposited at the periphery of the mutant 
protein bodies (12-15). 

We have postulated that the phenotype associated with fl2 
is caused by the accumulation of a novel 24-kDa a-zein protein 
(16, 17). This hypothesis is partially based on tight linkage 
between the gene encoding the 24-kDa protein and the fl2 
locus, but it is also consistent with the abnormal structure of 
the protein. The mutant protein is 2 kDa larger than expected, 
because of a defect in its processing following targeting to the 
ER lumen. An alanine-to-valine substitution at the C-terminal 
position of the signal peptide prevents its removal, thereby 
anchoring the protein to the lumenal face of the ER membrane 
(18). To investigate whether this protein is responsible for the 
mutant phenotype, we transformed normal maize plants with 
the gene encoding the mutant 24-kDa a-zein protein. We show 
here that seeds of these plants manifest the key phenotypic 
characteristics associated with the fl2 mutation. 

MATERIALS AND METHODS 

Transformation of Maize Plants. Maize embryos were 
cotransformed with the 24-kDa a-zein gene in plasmid 
pCC515 (17) and the bacterial bialaphos (BAR) gene (19) as 
a selectable marker. The 24-kDa a-zein gene in pCC515 is 
flanked by 3.0 kb of 5' and 3.7 kb of 3' noncoding sequences. 
The selectable marker gene plasmid is of the form 
ubi::ubiintron::BAR:-j>inH, where ubi is a maize ubiquitin pro- 
moter and first intron and pinll is the potato protease inhibitor 
II 3' noncoding sequence, Plasmid DNAs were delivered by 
microprojectile bombardment to embryogenically responsive, 
immature embryos from the maize variety High Type II (20). 
Embryos were recovered from herbicide- resistant calli and 
grown to maturity. Transgenic plants were outcrossed as the 
female parent to an inbred line, and the progeny were scored 
for the floury kernel phenotype. 

PCR Amplification of the 24-kDa a-Zein Gene. Genomic 
DNA was extracted from seedlings germinated from normal 
and floury kernels (21). The DNA was amplified by PCR using 
primers corresponding to the signal peptide coding sequences 
of the 24-kDa a-zein gene (5 -GCCCTTTTAGTGAGCG- 
CAACAAATGTG-3') and coding sequences for the seventh 
a-helical repeat of the protein (5 ' -GCAGGGTTTGCC AT- 
AGCTAGCTGATG-3'). Products were separated by electro- 
phoresis in 1% agarose gels, stained with ethidium bromide, 
and photographed with a Polaroid DS-34 camera. 

Protein Extraction from Maize Flour and Iramunoblotting. 
A portion of each endosperm was cut from the seed prior to 
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germination and converted into a fine flour using a ball mill. 
Proteins were extracted from the meal and separated accord- 
ing to their solubility in 70% alcohol (22). Alcohol-soluble 
(zein) proteins were separated by SDS/PAGE, blotted onto 
nitrocellulose, and immunoreacted with rabbit anti-a-zein 
polyclonal antibody (23). Alcohol-insoluble proteins were 
separated by SDS/PAGE, blotted, and immunoreacted with a 
rabbit anti-BIP polyclonal antibody (12). Goat anti-rabbit 
alkaline phosphatase conjugate was used for indirect detection 
of a-zein and BiP on the immunoblots (24). 

Fixation and Embedding of Endosperms and Electron 
Microscopy. Seeds were harvested 18 days after pollination 
from a self-fertilized maize plant that was hemizygous for the 
24-kDa a-zein transgene. Endosperms were fixed, embedded, 
sectioned, and viewed with a transmission electron microscope 
as described elsewhere (11). 

RESULTS 

Detection of the Transgene in Transformed Maize Seed- 
lings. Transgenic maize plants were generated by a biolistic 
method using microprojectiles coated with plasmid pCC515 
(17), which contains the 24-kDa a-zein gene within a genomic 
DNA fragment, and a plasmid containing the BAR gene (20), 
which confers resistance to the herbicide Bast a. Plants from 25 
herbicide-resistant events that were recovered from the trans- 
formation were crossed as females to an untransformed inbred 
line. Fi progeny from 17 of these crosses segregated approx- 
imately 1 to 1 for floury-appearing kernels, consistent with the 
presence of a single site of transgene integration. To determine 
whether this phenotype was associated with the insertion of the 
24-kDa a-zein gene, we used PCR primers for the coding 



region of the 24-kDa a-zein gene to amplify genomic DNA 
from Fj seedlings. A 560-bp fragment was produced from 
DNA of the floury seedlings (Fig. L4, lanes 4, 6, and 8), but 
not the wild-type seedlings (Fig. \A, lanes 3, 5, and 7). A 
fragment of similar size was amplified from W64A/72 DNA 
(Fig. M, lane 9), but not from DNA of untransformed 
seedlings (Fig. \A, lanes 1 and 2). 

Immunodetection of a-Zein Proteins from Seeds of Trans* 
genie Plants. An immunoblot of a-zein proteins from mature 
seeds was prepared to determine whether insertion of the 
24-kDa a-zein gene resulted in synthesis of this protein (Fig. 
IB). The blot shows that the appearance of a 24-kDa protein 
band, indicated by arrowheads in lanes 4, 6, and 8 of Fig. IB, 
was always associated with the floury phenotype and insertion 
of the transgene. The 24-kDa protein band in these samples is 
similar to a band of identical molecular mass that was found in 
W64A/72 zein (indicated by an arrowhead in lane 9 of Fig. \B). 
The 24-kDa a-zein was not present in samples from normal 
progeny or seeds of untransformed plants (Fig. IB, lanes 1, 2, 
3, 5, and 7). Three bands with molecular masses greater than 
24 kDa were detected in all of the samples. In W64A/72, these 
polypeptides are glycosylated forms of 19-kDa a-zein proteins 
that are products of a gene(s) closely linked to the fl2 locus 
(J. W. Gillikin and R. S. Boston, personal communication). 
Apparently, the same glycoproteins are present in the maize 
line used for the transformation, although they had no effect 
on the kernel phenotype. 

Analysis of BiP Expression in Seeds of Transgenic Plants. 
One phenotypic characteristic associated with the/72 mutation 
is overexpression of the 70-kDa ER-resident chaperone, BiP. 
Using anti-BiP antibody, an immunoblot of the alcohol- 
insoluble proteins showed that the amount of BiP in mature 




Fig. 1. Transformation of the 24-kDa a-zein gene into transgenic maize plants leads to synthesis of the encoded protein and overexpression 
of BiP. Analysis of 3 of 17 independent transgenic lines is shown. One normal and one floury kernel from transgenic lines 236300 (lanes 3 and 
4), 236133 (lanes 5 and 6), and 236294 (lanes 7 and 8) were analyzed and compared with kernels from the untransformed (lane 1, PI), the 
untransformed parent outcrossed to a normal inbred (lane 2, PI x P2), and W64A/72 (lane 9) plants. (A) Using PCR primers specific to the 24-kDa 
a-zein gene, a 560-bp DNA fragment was amplified from genomic DNA of seedlings germinated from floury kernels, but not wild-type kernels. 

(B) An immunoblot of a-zeins shows a 24-kDa protein band (indicated by arrowheads) in samples from floury kernels, but not wild-type kernels. 

(C) An immunoblot of the 70-kDa maize homolog of BiP. 
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transgenic kernels was comparable to that of W64A/72 (Fig. 
1C, lanes 4, 6, 8, and 9). Normal quantities of BiP were 
detected in wild-type siblings and untransformed kernels (Fig. 
1C, lanes 1, 2, 3, 5 and 7). 

Protein Body Structure in the Endosperms of Seeds of 
Transgenic Plants. Another phenotypic characteristic associ- 
ated with the fl2 mutation is the formation of misshapen 
protein bodies. Protein bodies in the developing endosperm of 
seeds not expressing the 24-kDa a-zein gene are circular in 
cross-section and are relatively discreet; similar to the wild type 
(Fig. 24; ref. 2), whereas those from seeds expressing the gene 
have a convoluted shape and aggregate into large masses of 
protein, similar to W64A/72 (Fig. 2B; ref. 11). Analysis at 



higher magnification revealed that protein bodies in the de- 
veloping endosperm of seeds not expressing the 24-kDa a-zein 
gene contain darkly staining zein proteins primarily on the 
periphery of the protein bodies, as is seen in wild type (Fig. 2C, 
arrowheads; ref. 2). In contrast, those from endosperms ex- 
pressing the gene have many regions of darkly staining zein in 
their interior (Fig. 2D, arrowheads; ref. 11), indicating an 
abnormal organization of a-, /3-, and y-zeins within the protein 
body. 

DISCUSSION 

Previous studies provided strong genetic evidence implicating 
a mutant 24-kDa a-zein protein as the cause of the phenotype 




Fig. 2. Comparison of protein bodies from wild-type and transgenic maize endosperm. Protein bodies in the endosperm of wild-type seeds are 
round (A and C) compared with the misshapen protein bodies in the endosperm of seeds expressing the 24-kDa a-zein gene (B and D). Comparison 
of these protein bodies at tower magnification revealed that they appear as discreet spheres (circles in cross-section) in cells not expressing the 24-kDa 
a-zein gene (A), whereas they have a convoluted shape and are aggregated in cells expressing this gene (/?). Comparison of these protein bodies 
at higher magnification highlights their differences in the internal structure. In cells not expressing the gene, the darkly staining zein proteins are 
found primarily on the periphery of the protein bodies (C, arrowheads), whereas in cells expressing the gene, the protein bodies have many locules 
of darkJy staining zein in their interior (D, arrowheads). The irregularities seen in B and D are similar to those found in protein bodies of W64A/72. 
PB, protein body; RER, rough endoplasmic reticulum; S, starch grain. Bars = 0.5 ^im. 
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associated with the maize fl2 mutation (16, 17). The experi- 
ments described here show that expression of the mutant gene 
in transgenic maize plants leads to the accumulation of the 
24-kDa a-zein protein, which is correlated to the overexpres- 
sion of BiP and the appearance of malformed protein bodies. 
Because these characteristics of the seed from the transgenic 
plants resemble the phenotype of the fl2 mutant, our obser- 
vation conclusively demonstrates that fl2 is a structural muta- 
tion in a 22-kDa a-zein gene. 

The mechanism whereby the 24-kDa a-zein protein disrupts 
the normal development of the protein body, leading to an 
altered texture of the mature endosperm, is not known. One 
possible explanation is that the mutant hydrophobic a-zein 
protein is anchored to the ER membrane and remains on the 
outer surface of the protein body, thereby disrupting protein 
interactions that are important to the maintenance of the 
spherical shape. Normally, a-zein proteins become seques- 
tered in the interior of the protein body (2). This hypothesis is 
supported by evidence showing that the 24-kDa a-zein protein 
is associated with the membrane fraction following translation 
and translocation in the presence of microsomes (18). 

Our results provide evidence of the utility of maize trans- 
formation for the analysis of genetic mechanisms. The amount 
of the 24-kDa a-zein protein detected in the transgenic floury 
kernels and the W64A/72 kernel was similar, suggesting that the 
expression of the transgene may be comparable to that of the 
native gene. This result is encouraging, because it demon- 
strates that transgene expression in tissues of transformed 
maize plants can be adequately controlled by native promoters. 
Furthermore, the 3 kb of 5' and 3.7 kb of 3' noncoding 
sequence included in the a-zein genomic clone must be 
sufficient for directing appropriate temporal and spatial ex- 
pression in the endosperm; however, it is not known whether 
the gene is transcribed in other tissues of the transgenic plants. 

Traditionally, techniques for stable transfer of DNA to 
monocotyledonous cereals have lagged behind similar work 
with dicotyledonous plants (25). Transformation of dicot 
species usually involves gene delivery through infection with 
Agrobacterium tumefaciens, but monocots are not readily ame- 
nable to infection by this bacterium. Although Ishida et al (26) 
reported stable maize transformants following infection by A. 
tumefaciens, most successful transformations of maize plants 
have used microprojectile bombardment to deliver DNA to 
embryogenically responsive cells from immature embryos. 
Since the first report of the generation of stable transgenic 
maize plants using the biolistic method (27), this technique has 
been used to introduce a number of agronomically important 
traits into corn, such as insect resistance (20, 28), viral resis- 
tance (29), and fructan production (30). These advances, along 
with our report, attest to a promising future for the use of 
transgenic technologies in the genetic study and agronomic 
improvement of maize. 

We thank Hank Bass for helpful discussions at the inception of this 
work, Susan Martino-Catt for technical assistance, and Becky Boston 
for providing the BiP antibody. This research was supported by a grant 
from the Department of Energy to B.A.L, (DEFG0292ER 20079) and 
a postdoctoral fellowship from the National Institutes of Health to 
C.E.C. 
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A comparison of the DN A and protein sequences of a 
group of zein cDNA clones reveals that they share 
extensive sequence homology and probably originated 
from a common ancestral gene. A comparison of clones 
corresponding to M r 22,000 polypeptides shows they 
are 92% homologous, while five clones corresponding 
to the M r 19,000 zeins vary in homology from 75 to 
95%. The clones corresponding to the M r 22,000 pro- 
teins are 60—65% homologous to clones encoding the 
M r 19,000 zein proteins. A clone corresponding to the 
Mr 15,000 zein has little homology to either the M T 
22,000 or 19,000 zeins. Clones corresponding to both 
the M r 22,000 and 19,000 zeins have two putative 
polyadenylation signals. SI nuclease mapping indi- 
cates that the first polyadenylation signal following 
the stop codon is utilized by the Af, 22,000 sequences, 
while primarily the second polyadenylation signal is 
utilized by the M r 19,000 sequences. 



To study the expression of zein genes during maize endo- 
sperm development, we constructed and characterized a num- 
ber of full-length cDNA clones (Marks et al., 1986). On the 
basis of cross-hybridization studies, these clones were divided 
into nine distinct groups: one for the M t 15,000 zein, five for 
the Af, 19,000 zeins, and three for the M r 22,000 zeins. No 
clones for the M t 10,000 or reduced soluble protein were 
isolated. However, the clones that were identified appear to 
represent a large proportion of the zein mRNA sequences in 
the endosperm mRNA population (Marks et al, 1985). 

Sequences of clones related to some of those in the groups 
we describe have been reported (Heidecker and Messing, 1983; 
Geraghty et al., 1982; Spena et al., 1982). Previous studies 
revealed that genes encoding the Af f 19,000 and 22,000 zein 
groups are homologous and probably have a common ancestral 
sequence (Marks and Larkins, 1982; Spena et al 9 1982), but 
the M r 15,000 zein sequence was found to belong to an 
unrelated gene family. 1 Genes within the Af, 19,000 and 22,000 
zein groups have diverged from one another through base 
substitutions, small insertions/deletions, and relatively large 
internal duplications (Heidecker and Messing, 1983; Marks 
and Larkins, 1982). Many of these genes contain two func- 
tional polyadenylation signals (Messing et al, 1983), and 
several of the genes appear to have multiple transcriptional 
start sites (Langride and Feix, 1983). 

This study presents a comprehensive comparison of se- 
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quences belonging to eight M t 19,000 and 22,000 zein groups, 
and it presents the sequence of a full-length cDNA clone for 
a Mr 15,000 zein that contains a very long 5' noncoding 
region. We have also analyzed the usage of the polyadenyla- 
tion signals for the M, 19,000 zein genes. 

MATERIALS AND METHODS 

DNA restriction endonucleases, Klenow fragment, T4 kinase, and 
SI nuclease were purchased from Betheeda Research Laboratories. 
l-r**P]ATP, [ft-^PldATP, and [a-^IdCTP were purchased from 
New England Nuclear or Amersham Corp. Nitrocellulose was ob- 
tained from Schleicher and Schuell. Calf intestinal alkaline phospha- 
tase was purchased from Boeh ringer Mannheim. 

5 '-Terminal End Labeling and DNA Sequencing— Restriction frag- 
ments were prepared for sequencing by the radioactive end-labeling 
method of Maxam and Gilbert (1980) for 5' ends as described by 
Pedersen et al (1982). DNA fragments were sequenced by the method 
of Maxam and Gilbert (1980) with the modification of Krayev et aL 
(1980). 

Computer Analysis — Sequences were analyzed using the Map and 
Gap programs developed by Devereux et al (1984). 

SI Nuclease Mapping— Fragments for Si nuclease mapping were 
either 5' end labeled as described by Maxam and Gilbert (1980) or 3' 
end labeled by filling in recessed 3' ends with Klenow fragment as 
described by Maniatis et aL (1982). End-labeled fragments were 
cleaved by restriction enzyme digestion and resolved on a 5% poly- 
acrylamide gel The fragments of interest were excised from the gel 
and isolated by the crush-soak method of Maxam and Gilbert (1980). 

Si nuclease mapping was performed by the method of Berk and 
Sharp (1977) as described by Maniatis et al. (1982). One pg of 
endosperm poly(A) RNA plus 200 *ig of tRNA were added to 3 X 10 4 
to 1 x 10 6 cpm of labeled fragment in 20 \A of 40 mM Pipes 1 (pH 6.4), 
1 mM EDTA (pH 8.0), 0.4 M NaCl, and 80% formamide. Reactions 
were heated to 80 'C for 10 min and then incubated overnight at 
58 *C for Z15 fragments (% G + C = 55%) and at 52 *C for Z19 
fragments {% G + C = 48%). The hybridization reaction was stopped 
by adding 0.38 ml of cold Si nuclease buffer (0.28 M NaCl, 0.05 M 
NaOAc (pH 4.6), 4.5 mM ZnSO*, and 20 ^g/ml denatured calf thymus 
DNA) containing 400 units of Si nuclease. The Si digestion reactions 
were incubated at 37 *C for 30 min after which they were extracted 
with phenol/chloroform and the DNAs precipitated with ethanoL 
Samples were dissolved in sequencing dye (80% formamide, 0.5 x 
TNE (10 mM Tris-HCl, pH 7.4, 15 mM NaCl, and 1 mM EDTA), 
0.1% bromphenol blue, and 0.1% xylene cyanol) and loaded onto a 
6% polyacrylamide sequencing gel that contained 50% urea. 

Northern Blot Analysis —Four /ig of poly(A) RNA from membrane- 
bound polysomes of 18 day -after- pollination endosperms was resolved 
by electrophoresis on an agarose gel that contained methyl mercury 
hydroxide (1.4% agarose, 5 mM methyl mercury hydroxide, 50 mM 
boric acid, 5 mM sodium borate, 10 mM sodium sulfate, and 0.01 mM 
Na»EDTA) (Bailey and Davidson, 1976). The gel was soaked in 0.5 
M ammonium acetate and then stained with ethidium bromide. Gel 
slices that contained the upper and lower zein mRNA bands were 
excised and recast in separate 1.4% agarose gels. The RNA waa 
transferred to nitrocellulose filters (Thomas, 1980), and the nitrocel- 
lulose was prehybridized for 8 h at 42 "C in 50% formamide, 5 X SSC 
(1 x SSC = 0.15 M NaCl, 0.015 M sodium citrate), 0.05 M sodium 
phosphate buffer, pH 6.8, 0.1% bovine serum albumin, 0.1% Ficoll, 
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0.1% polyvinylpyrrolidone, 200 n%/ml heat ^denatured sheared calf 
thymus DNA. The nitrocellulose was then hybridized overnight at 
42 "C in 50% formamide, 5 x SSC, 0.06 M sodium phosphate buffer, 
pH 6.8, 0.1% bovine serum albumin, 0.1% Ficoll, 0.1% polyvinylpyr- 
rolidone, 200 Mg/ml sheared calf thymus DNA, 5% dextran sulfate, 
and 7 X 10* cpm of the isolated zein cDNA inserts labeled with **P 
by nick translation (Maniatis et at, 1976). Following hybridization 
the nitrocellulose was washed twice in 1 X SSC containing 0.1% 
sodium dodecyl sulfate at room temperature and twice in 1 X SSC 
containing 0.1% sodium dodecyl sulfate at 68 *C. Filters were air 
dried and autoradiographed 

RESULTS 

By cross-hybridization analysis at a stringent criterion (T„ 
—15 *C), we were able to distinguish nine groups of zein cDNA 
clones (Marks et aL, 1985). The DNA sequences of several 
members from three of these groups was previously reported 
(Marks and Larkins, 1982; Pedersen et aL f 1982). 1 In this 
study seven cDN A clones from the remaining six groups were 
completely sequenced by the method of Maxam and Gilbert 
(Fig. 1). 

Analysis of M r 22,000 Zein Sequences— Each of the clones, 
cZ22A-l and cZ22B-l, encodes a complete zein protein, 
whereas the third, cZ22C-2, lacks a few coding bases on the 
5' end (Fig. 2). Both cZ22B-l and cZ22C-2 contain an entire 
3' noncoding region, and cZ22A-l contains a long 5' noncod- 
ing region. The cZ22B-l and cZ22C-2 sequences align without 
gaps; however, when these clones were compared to cZ22A-l, 
four short gaps were introduced to maximize homology. The 
largest gap results in a 9-base pair deletion in cZ22A-l relative 
to cZ22B-l and cZ22C-2. Overall, the deletion/insertion 
events have left cZ22B-l and cZ22C-2 with three more amino 
acids than cZ22A-l. The clones differ from one another by 6- 
7%, and these nucleotide differences are fairly evenly distrib- 
uted throughout the sequences. The longest stretch of perfect 
homology occurs in the region of the stop codon and is only 
56 nucleotides in length. Forty-four per cent of the base 
changes have resulted in amino acid substitutions. In general, 
the physical properties of the substituted amino acids are 
conserved For example, seven of the 26 amino acid changes 
that occurred during the divergence of cZ22A-l and cZ22B-l 
involved exchanges between valine and alanine (Fig. 3). A few 
substitutions have not been conservative. For example, at 
position 103 (Fig. 3) alanine is replaced with aspartic acid, 




Fig. 1. Restriction enzyme map and sequencing strategy of 
zein cDNA clones. Only selected restriction sites are shown. Hori- 
zontal arrows indicate the strategy followed for the determination of 
DNA sequences. The cDNA clones are: A, cZ22C2; B, cZl9A2; C, 
cZ19BL; D, cZl9Cl; E, cZ19C2; F, cZ19Dl; and G, cZ15A3. Restriction 
enzymes: B, Bali; Bra, BamHl; BnII t Banll; C, Hincll, D, Ddel; /\ 
Hind; N, Ndel; and P, Pstl. 



and at positions 177 and 227 glutamic acid is replaced with 
glutamine and alanine, respectively. One other substitution 
resulted in a tryptophan residue at position 258. This amino 
acid has been found only in zein sequences closely related to 
the cZ22C group (Spena et aL, 1982; Geraghty et ai, 1982). 

Analysis of M, 19,000 Zein Sequences— We identified five 
distinct groups of M T 19,000 zein clones. Members of the four 
groups Z19A, Z19B, Z19C, and Z19D did not cross-hybridize 
at a stringent criterion, whereas those of the fifth group, 
Z19AB, cross-hybridized to members from Z19A and Z19B. 
The representative member of the Z19AB group, gZl9AB-l, 
is a subclone of the genomic clone gZ99, whose sequence was 
previously reported (Pedersen et aL t 1982). The remaining 
groups are represented by cDNA clones, and their sequences 
are compared to that of gZl9AB-l (Fig. 4). 

Sequences from the groups Z19A, Z19B, and Z19AB are the 
most closely related and show 94-95% homology. As was the 
case with the M t 22,000 zein clones, the nucleotide differences 
among these sequences show a fairly random distribution 
(Fig. 4). This is especially true of the cZ19A-2 and cZl9B-l 
sequences; the longest stretch of perfect homology between 
them is less than 50 nucleotides. However, both of these 
clones have regions of up to 70 nucleotides that are identical 
to gZ19AB-l. The differences in the distribution of nucleotide 
changes between clone groups Z19A, Z19B, and Z19AB may 
explain their cross- hybridization behavior. 

To compare members of the same group we determined the 
DNA sequences of cZl9C-l and cZ19C-2 (Fig. 4). These two 
sequences differ by only one nucleotide in the 5' noncoding 
region and by four bases over the first 219 coding nucleotides. 
From that point to the end of the shorter clone, cZ19C-2, the 
sequences are identical. The four base differences lead to 
three conservative amino acid differences (Fig. 5). 

To align the sequences of Z19A, Z19B, Z19AB, and Z19C 
groups, several gaps were introduced (Fig. 4). The largest is a 
24-base pair deletion in the Z19A, Z19B, and Z19AB sequences 
(collectively referred to as Z19A/B) relative to the Z19C 
sequences. The insertion/deletion events that have taken 
place during the divergence of the genes corresponding to 
these clones have rendered the Z19C zeins with six more 
amino acids than the Z19A/B zeins. The Z19C sequences are 
on the average 85% homologous to the Z19A/B groups. At the 
3' end the average of 85% homology is maintained However, 
in the 5' noncoding region these sequences are only 55-60% 
homologous. 

To align cZ19D-l with the other sequences, additional gaps 
had to be introduced (Fig. 4). The longest is a 24-base pair 
deletion in Z19D-1. This sequence has diverged more than 
the others and is only 75% homologous. 

Five of the six clones encode complete zein polypeptides 
(Fig. 5). The remaining clone, cZl9A-2, is missing nucleotides 
for the first three codons. All of the complete polypeptides 
contain a 21 -amino acid signal peptide and an internal tan- 
demly repeated peptide of approximately 20 amino acids that 
has been previously described (Argos et aL, 1982). The poly- 
peptides range in size from 233 to 240 amino acids. As in the 
case of the M, 22,000 zein clones, approximately 45% of the 
nucleotide differences among these clones result in amino 
acid substitutions. Most of the resulting changes are conserv- 
ative; however, at a few positions neutral amino acids have 
been replaced with charged, and vice versa. 

Northern Blot A naiysis— Previous analyses revealed that 
the mRNAs encoding the M t 22,000 zeins can be resolved 
from the mRNAs for the M T 15,000 and 19,000 zeins by gel 
electrophoresis under denaturing conditions (Wienand and 
Feix, 1978; Larkins et aL, 1983). However, the mRNAs for the 
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16453 



Met 

_T •tart! A * 
cZ2S9l fttart: 
cZ2CC2 

CTT TTA OTC AOC OCA ACA AAT OCC TTC ATT ATT CCA CAO TGC TCA CTT GCT CCT ACT OCC AOT ATT CCA CAO TTC CTC CCA CCA CTT ACT ^ 



I C A t A 



ceaU TCAATCOOTTlCC^CATCCAOCCarOC^C^TACAOaCTA 

SSl « 5,1 T A AT AA 

CZ22C2 C CTT * 

cfiBU C«CMCAAtCCmOCACATCT*ACCC«C^ ^ 

Cf22»l AA £ G *** CAO A AT 

CZ22C2 AC 

0*2*1 ATG OTC AAC CCT OTC AOC TAC TTC CAA CAG CAO CTC CTT CCA TCC AAC CCA CTT OCT CTG OCC AAC OTA OCT OCA TAC CAO CAA CAA CAA ^ 

CE22BL 0 A A 

CZ22C2 CO 

ctZZU CAC CTQ CAA CAO TTT ATO CCA CTG CTC ACT CAA CTA OCC ATO OTC AAC CCT OCC CTC TAC CTA *** *** *** CAA CTA CTT TCA TCT AGC ^ 

S C C C T C CAA CAO CAA 

cfiKAl OCC CTC OCC GTO OCC AAT OCA CCT ACC TAC CTA CAA CAA CAO TTC CTQ CAA CAA ATT OTA CCA CCT CTC ACT CAC CTA CCT CTC CCA AAC 

d22»l TC A C GAT G * *™ 



T 



T AC A G 



cf22Al OCT OCT OCC TAC TTA CAA CAO TTG CTT CCA TTC AAC CAA CTQ CCT CTG TCA AAC TCT CCT GCG TAC CTA CAA CAG CCA CAA CAG TTA CTT 

CX22U. TCC AAAGT *<*9 

CX22C2 0 C AC 



<£22A1 AAT CCA TTC OCA GTO OCT AAC CCA TTO OTC OCT ACC TTC CTO CAO CAG CAA CAA CAA TTO CTG CCA TAC AAC CAfl TTC TCT TTO ATC AAC 

SSE OA*** C T VT59 

CC22G2 CAA C A OCA *** A C 

-COOH 

c*22Al cCTOCCTTC^^CAGCAACCCATCOTTOC^OOTaXATCm ATTACAT ATGAGATCTACTCC^CAATOOTOCCCTCATA : end 

d22ftl T TCOAOQ * CCCOC ATCTCTTTCC TA *862 

CZ22C2 TOO TOO T CCQOCATOTOTTTCCTA 



aazcz qaaataaJcaata^ *960 
Fig. 2. Comparison of the nucleotide sequence of clones for the M x 22,000 zeins. Nucleotides are 
numbered starting with the first base of the initiation codon and are indicated on the right-hand margin. The 
sequences of cZ22Al and cZ22Bl (previously named pZ22.1 and pZ22.3, respectively) were previously reported 
(Marks and Larkins, 1982). The complete nucleotide sequence of cZ22Al is given, but only variable nucleotides at 
corresponding positions are listed for the other clones. Asterisks indicate positions where we have introduced gaps 
in the sequence to maximize homology. The positions of the first nucleotides for cZ22B-l and cZ22C-2 follow the 
colons. An indicates the position at which a sequence terminates in a poly(A) tail. Positions corresponding to the 
initiating methionine (Met) and the NH* terminus and COOH terminus of the encoded polypeptide are indicated. 



CZ22AU* HATXllALLA LIALLVSATM AFIIPQCSU PSASIPQFLP PTOHOTHP AVQAYRLQLA LAASALQQPI AQLQQQSLAH LTLOJTIATCJQ QQQQFLPSLS 
C222B1** SFAS SI A Q I V I*A 

c222C2aa Start: f I • L Q V K I * A 



c222AUa HLAMVWVTT LOjOjQUASW LALAWVAATQ QQQQLQQfMP VLSQLAMVKP AVYL***qtL SSSPLAVCHA PTYLQQQLLQ QIVPALTQLA VAKPAATLQQ 
cZ22B1a* A VHUA AQQQ A Z V200 

cZ22C2*a Q DV A LA A QQQ 

-COOH 

cZ2ZAU« UJTWQLAVS KSAAYLQQRQ QLLKPLAVAK PLVATTLOjQQ QQLLPYNQFS U«PAL**OjQ PIVGGAIF 

cZ22U*a TM A * R V SR 2oO 

cZ22C2*# T E f A ♦ S SW 

Fig. 3. Comparison of amino acid sequences deduced from DNA sequences of clones for the M r 
22,000 zeins (Fig. 2). The complete sequence of cZ22Al is given in the standard single letter amino acid code, 
but only variable amino acid residues at corresponding positions are listed for the other sequences. The position 
corresponding to the NH 2 terminus of the mature protein is indicated. Numbering of amino acids begins with the 
first methionine in the signal peptide. Amino acids are divided into groups of 10 and include gapped positions. 
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-45 

CZ19A2 . .... IT AIT i T C 

cZl9ftl STAKTt C C C C 

gZ19All ATCCCACATATTATTCACAtXAA CTA<XJU^T ATM^MJUXA^^^CAATATTCTACCAATA ATG OCA CCC AAA ATA TTT TOC CTC ATT ATC CTC CTT OCT 

CZ19D1 ACATACrXJUUTtACACTACCAATATCTTACCAiOCA ATC OCA CCC AAC ATT TTT GCC CTC CTT CCC CTC CTT OCT 

CZ19C2 AffAACTftACATAiauUM^CCAATCCACT^^ ATC CCC ACC AAC ATA TTT TCC CTC CTT ATC CTC CTT OCT 

cZtKl STA«: C 



♦ 3* 



cZl*A2 C C A A 

cXim C C C T 

ftZltAftl CTTTCTCCAACTCCTCCTAXXCCCAOCAnTTCCCCCAATCCT^ 

cZl9Dl CTT TCA CCA AAC CTT CCT ACC CCC ACT ATT ATT CCA CAA TCC TCA CAA *+* *** *** *** *** *** *** *** CAA TAC CTC TCT CCC CTC 

cZ19<2 CTT TCT ACA TCT CTT CCT AAC CCC ACA ATT TTC CCT CAA TOC TCA CAA CCT CCT ATA CCT TCC CTT CTT CCC CCA TAC CTT CCA TCA ATT 

CZ19C1 C C 



♦ 1M 



cZl»A2 CCC A 

czmi C C CCA T 

(Z19AB1 ATC TCT TCA CTA TCT CAA AAT CCA ATT CTT CTA CCC TAC ACC ATC CAA CAC OCA ATC CCA CCA CCC ATC TTA CCT TTA TCA CCC TTC TTC 

cZl901 ACA CCC CCC ACA TTT CAA TAC CCA ACT ATA CAA TCC TAC ACC CTA CAA CAC CCC ATC CCA CCA ACC ATC TTA CCC TCC TTA OCA TTC ACT *2l* 

CZ19C2 ATA CCT TCA ATA TCT CAA AAC CCA OCT CTT CAA CCA TAT ACC CTT CAA CAA CCA ATC CCA CCA ACC AAC ATA CCT TTA TCC CCC TTC TTC 
cZl9Cl C C C 



cZ19A2 C C T T 

czmi a t 

gZUABl CTC CAA CAA TCA TCA CCC CTA TTA CAA CAC TTA CCT TTC CTC CAT TTC TTC OCA CAA AAC ATC ACC OCA CAA CAA CTA CAA CAA CTC CTC 

CZI9D1 CTC CAA CAA CCA TAT CCC CTA TTC CAA CAA CCA TCC TTA CTC AAT CTA TAT CTC CAA ACA ATC CTA OCA CAA CAA CTA CAA CAA CAA TTC *309 

CZ19C2 TTT CAA CAA TCC CCA CCC CTA ************ TCT TTC CTC CAC TCA TTC CTA CAA ACC ATC ACC CCA CAA CAC CTC CAC CAA CTC CTC 

cZ19Cl 



CZ19A2 

czmi 

gZl9ABl CTA *** *** *** *** 
cZlftH CTT CCA ACA ATC AAT 

CZ19C2 CTA CCT CTC ATC AAC 

CZ19C1 



C A 

**+ *** ****** OCA AAC CTT CCT CCC 
CAA CTA CTT CCA CCC AAC CTT CAT CCT 
CAA CTA CCT CTC CCA AAC CTT TCT CCC 



T T 
C T 
TAC TCT CAC CAA CAA CAC TTT CTC 
TAC CTC CAC CAA CAA CAA TTT CTT 
TAC TCT CAC CAA CAA CAA TTT CTT 



C 

C T 

CCA TTC AAC CAA CTA CCT CCA TTC 

CCA TTC AAT CAA CTA CCT CCC CTC *399 

CCA TTjC AAC CAA CTC TCT ACA CTC 



CZ19A2 ****** A 

czmi T A *** A 

gZ19ABl AAC TCT CCT CCT TAT TTC CAC CAA CAA CAA CTA CTA CCA TTC 

CZ19D1 AAC CCT CCT CCT TAC TTC CAC CCA CAA CAG CTA CTA CCA TTC 

CZ19C2 AAC CCT CCT GCT TAT TTG CAC *** CAA CAA CTA TTA CCA TTT 
cZ19Cl 



A 

C A 
ACC CAC CTA *** ****** CCT CCT CCC TAC CCC CCC *** *** CAA TTT 
AAC CAA CTT CTC AGO ACC CCT GCT CCC TTC TTA CTC CAG CAA CAC TTC +4*9 
ACC CAG CTA *** *** *** CCT ACT CCC TAC TCT CAC CAA CAA CAA CTT 



CZ19A2 T 
CZ19B1 

gZ19AM CTT CCA TTC AAC CAA 

CZ19D1 TTC CCA TTC CAT CTA 

CZ19C2 CTT CCA TTT AAC CAA 
CZ19C1 



CC 
C 

CTC CCA OCA TTC AAC TCT CAT CCT TAT 
CAA CTT CTC OCA AAC ATT CCT CCT TTC 

TTC CCC CCA CTC AAC CCC CCT CCT TAT 



T CCC 

T C 

CTA CAA CAA CAA CAA CTA CTA CCA 

TTC CAA CAA CAA CAA TTC CTC CCA 

TTC CAC CAC CAA ATA CTA CTA CCA 



A 

C 

TTC ACC CAC CTA CCT CCT CTC ACC 

TTT TAC CCA CAC CTT CTC CCA AAC *579 

TTT ACC CAC CTA CCT CCA CCA AAC 



CZ19A2 

CZ19B1 A 

gZl*ABl CCT CCT CCC TTC TTC 

CZ19D1 ATT AAC CCC TTC TTC 

CZ19C2 CCT CCT TCC TTC TTC 
CZ19C1 



A 

A C 

ACA CAC CAA CAC TTC TTG CCC TTC TAC 
CAA CAC CAA CAC TTC CTC CCA TTC TAC 
ACA CAC CAA CAC TTC CTC CCT TTC TAC 



C C 
A CT C 

CTC CAC ACT CCC CCT AAC CTT CCC 
CCA CAC CAT CTC CCA AAC AAT CTC 
CAC CAC TTT CCC CCT AAC CCC CCA 



C 

ACC CTC TTA CAA CTC CAA CAA TTC 

CCC TTC TTA CAA CAA CAA CAA TTC +669 

ACC CTC TTA CAA CTA CAA CAA TTC 



-CCOB 

CZ19A2 * 
cZ19U A T 

ftZ19ABl CTC CCA TTC CAC CAA CTT CCT TTG ACA AAC CCA CCA CCC TTC TAC CAA CAA CCC ATC ATT GCT CCT CCC CTC TTT TAC 

CZ19D1 CTC CCA TTT ACC CAA CTT OCT TTC ACC AAT CCT ACC ACC TTA TTC CAC CAC CCC ACC ATT GCT CCT CCC ATC TTC TAC 

cZ!9C2 TTCCCCTTrCTCCAAOTCCTrrCACACACCCACCCCCCT^ 
C219C1 



cX19A2 

czmi 

*X19AB1 

CZ19D1 
cZ19C2 
CZ19C1 



C CAC An:eo* 

T AT C A*:««d 

ATTCCTTATCACrrATACTTCAATAATAAAV 1 1 H U 11X tlA TATTTCTCCCTTCCCACAAAT^ • • • 

ATTTTITATCCTTTATACTCTAATAATAA^ : 

ATTCCTrATTACTTCTAATTCAATAATAAACT^ • •"<" 

ACATTCTAATCTCATATTCCT 

Fig. 4. Comparison of the nucleotide sequence of clones for the M r 19,000 zeins. The nucleotide 
sequence of gZ!9ABl (previously named ZG99) was previously reported (Pedersen et al, 1982). The complete 
sequences of gZ19ABl, cZl9Dl, and CZ19C2 are given. The sequences of the cZ19A2 and cZ19B1 follow the colons, 
and only nucleotides in the sequences that vary from gZl9ABl are shown. Likewise, the cZ19C1 sequence begins 
after the colon and only nucleotides in that sequence which vary from CZ19C2 are shown. An indicates a sequence 
terminating in poly{A) tail. Asterisks indicate positions where we have introduced gaps in the sequences to 
maximize homology. The putative polyadenylation signals are underlined. Positions corresponding to the initiating 
methionine (Met) and the NH, terminus and COOH terminus of the encoded polypeptide are indicated 



Nucleotide Sequence of Zein mRNAs 



16455 



cZ19Dl 
C219C1 
CZ19C2 



ART : FL T T V Q * 

L T S V Q 

HAAKIFCLIN LLGLSASAAT ASIFPQCSQA PIASLLPPYL SPAKSSVCEH PILLPYHIQQ AlAACILPLS PLFLQQSSAL LQQLPLVHLL AQNIRAQQLQ 

KAAKIFALLA LLALSANVAT ATIIPQCSQ* QYL SPVTAARfTTf PTIQSTRLQE AIAASILRSL ALTVQQPYAL LQQPSLVKLY LQRIVAQQLQ 

KATKirSLLM LLALSACYAW ATIFPQCSQA PIASLLPPYL PSKlASVCEN PALQPYBLQQ AIAASNIPLS PLLFQQSPAL ••••SLVQSL VQTIRAQQLQ 
T I I 



cil9A2 
CZ19B1 



GK * • Q Q ALPD 

c^hdx GSS*<4 PQ PL GT 

CZ19AB1 QLVL ANLAAYSQ QQQFLP7NQL AALMSAAYLO, QQQLLPFSQL ♦••AAAYPR* •QFLPFWO.LA AUtSHAYVQQ QQLLPFSQLA AVSPAAFLTQ 

ctl9Dl QQLLPTIWQV VAANLDAYLQ QQQFTJPFNQL AGVNPAAYLQ AQQLLPTNQL VRSPAATLLQ QQLLPTHLQV VANIAATLQQ QQLLPFTPQV VCNINATLQQ ?00 
CZ19C1 QLVLPLINQV ALANLSPYSQ QQQFLPmjL STLKPAAYLQ •QQLLPTSQL • * *ATAYSQQ QQLLPTNQLA ALKPAAYLQQ QILLPFSQLA AAMRASFLTQ 



CZ19C2 



CZ19A2 

cZ19Bl 

CZ19AB1 

CZ1901 

cZ19Cl 



AAV If T 

P Q V A If V 

QQLLPmHT APNVGTLLQL QQLLPFtXjLA LTtfPAAFYQQ PITGGALF 

QQLLPrrPQD VAKHVAFLQQ QQLLPTSQLA LTHPTTLLQQ PTIGGAIT 

QQLLPTYQQF AANPATLLQL QQLLPTVQLA LTDPAASYQQ HIICGALF 

FiG. 5. Comparison of amino acid sequences deduced from the DNA sequence of clones for the M r 
19,000 zeins (Fig. 4). The complete amino acid sequences deduced from gZl9ABl, cZ19Dl, and cZl90l are 
given. Only amino acid residues for cZl9A2 and cZ!9Bl that differ from gZl9ABl are shown as are only amino 
acid residues for CZ19C2 that differ from the sequence of cZldCl. Asterisks indicate gaps in the amino acid 
sequence. The positions of the NH 2 terminus and COOH terminus of the mature proteins are indicated. The 
number of amino acids begins with the first methionine in the signal peptide. Amino acids are divided inio groups 
of 10 and include gapped positions. 
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Fig. 6. Northern blot analysis of zein mRNAs. A, 18 DAP 

endosperm poly(A) RNA from membrane -bound polyribosomes was 
resolved on 1.4% agarose gels that contained 5 mM methyl mercury. 
RNA from gel slices that contained the top {T) and bottom {B) zein 
mRNA bands was transferred to nitrocellulose and hybridized to 
nick- translated zein cDNA inserts as described under "Materials and 
Methods." The cDNA probes were: B, cZl5A3; C, cZ19Cl; D, cZ22Bl; 
and £, gZ19ABl. 

M t 15.000 and 19,000 zeins have not been resolved. We used 
a modified Northern blot analysis to determine if our clone 
groups correspond to distinct mRNA populations encoding 
the M F 22,000, 15,000, or 19,000 zeins. 

Poly(A) rRNA was isolated from membrane-bound poly- 
somes of maize endosperm and resolved on a methyl mercury 
agarose gel (Fig. 6A). The upper and lower bands correspond- 
ing to zein mRNAs were excised, blotted onto separate nitro- 
cellulose filters, and hybridized to representative clones. 
Probes representing the Z15 and Z19C groups hybridized only 
to the lower band (Fig. 6, B and C). The cZ22B-l probe 
hybridized only to the upper band under conditions that 
allowed cross -hybridization between the three closely related 
M T 22,000 zein groups (Fig. 6D). These results indicate that 
the mRNA populations corresponding to the respective clone 
groups are homogenous and probably encode only a single 
size class of protein. The 19AB probe hybridized to both the 
upper and lower bands (Fig. 6f£), a result which is consistent 
with the existence of two subsets of mRNAs corresponding to 
this group (Hu et aL, 1982; Heidecker and Messing, 1983). 
Presumably, one group of mRNAs encodes M t 19,000 zeins 
and the other encodes M T 22,000 zeins. 

Analysis of a M T 15,000 Zein cDNA Clone—We recently 
compared the sequence of a cDNA clone and a genomic clone 
encoding a M, 15,000 zein. 1 The cDNA clone lacks a portion 
of the 5' coding region but contains the entire 3' noncoding 
sequence of its template mRNA. Except for one nucleotide 



difference, the cDNA sequence is identical to the genomic 
sequence for its entire length. 

We report here the sequence of an additional cDNA clone 
that encodes a complete M r 15,000 zein polypeptide (Fig. 7). 
This sequence differs from the genomic clone 1 by a few 
nucleotides. Since we estimated that there may only be two 
genes for the M r 15,000 zeins (Wilson and Larkins, 1984), 
these two sequences may represent them. However, some of 
the base differences between the clones may be due to se- 
quencing artifacts. Because of the high G + C content, many 
regions of the sequencing gels were compressed, making the 
sequence determination extremely difficult. Essentially all of 
the base differences were in these regions. 

Previous Si- mapping data suggested that the 5' end of the 
mRNA for the M r 15,000 gene began 65-70 nucleotides before 
the start codon (see Fig. 7). However, this larger cDNA clone 
has over 150 noncoding nucleotides at the 5' end that align 
perfectly with the 5' noncoding sequence of the genomic clone. 
The existence of this cDNA clone indicates that transcription 
may originate from two regions of the gene. To investigate 
this, we performed an Si -mapping analysis. A HindlU-Ddel 
fragment that contains the 5' end of the mRNA sequence was 
hybridized to poly (A) RNA isolated from staged endosperms 
(Fig. 8A) and the hybrids digested with Si nuclease. The 
majority of protected fragments ended in the same region 
previously shown to be the 5' end of the mRNA. However, 
upon longer exposure of the gel we detected some transcripts 
that protected the complete noncoding region of the longer 
cDNA clone (Fig. 8B). While these results are consistent with 
the possibility of a major and minor origin of transcription, 
they do not exclude processing of a larger transcript for this 
gene. The longer transcript appears to be more abundant at 
early stages of endosperm development, as is indicated by the 
greater protection of the larger fragment with poly(A) RNA 
from 12 DAP endosperms than from later stages (Fig. 86). 
Further analysis of the transcriptional activity of the M t 
15,000 gene will be presented elsewhere: 1 

The last 23 nucleotides of the 3' end of the cZl5A3 were 
not homologous to the genomic clone. The terminal 28 nu- 
cleotides were found to be an inverted copy of the sequence 

x R. S. Boston and B. A. Larkins, manuscript in preparation. 
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AATCCATCTACTATCTTTCCACGTCATG^ 



- 39 



GCGGCACTCCCT AOAGACCATCCTCCAACC AGAA CAGC + ATG AAC ATC CTC ATC GTT CTC GTC CTG TGG CTG GCT CTG TCA GCT GCC AGC GCC TCT CCA 



60 



ATC CAG ATC CCC TGC CCC TGC GCC GGG CTG CAC GGC TTC TAC OCC GCT CGC GCC GGC CTG ACG ACG ATC ATG GGC GCC CGC GGC CTG TAC 
MOMPCPCAGLQCLYCAGAGLTTHHCAGGLX 



♦ 150 



CZ15A3 



CCC TAC GCC CAC TAC CTC AGO CAG CCC CAC TGC AGC CCG CTG GCC CCC GCC CCC TAC TAC GCC GGG TCT GGG CAG ACG AGC GCC ATC TAC *2*0 
PYAEYLRQPQCSPLAAAPYTACCCQTSAHI 



CZ15A3 



CAC CCC CTC CGC CAA CAG TGC TGC CAG CAG CAC ATG AGG ATG ATC CAC CTG CAG TCC CTC GCC CAG CAG CTG CAG ATG ATG ATC CAC CTT *330 
nPT ROQCCOQGNRMMDVQSVAQQLQMMMgi 



CZ15A3 



CAC OCT GCC GCT ACC GCC AGC AGC AGC CTC TAC GAG CCA GCT CTG ATG CAG CAG CAC CAG CAG CTG CTG CCA GCC CAA GGT CTC AAC CCC 
EFAATASSSLYEPALMQQQQQLLAAQGL»P 



CZ15A3 



ATG GCC ATG ATG ATC GCC CAG AAC ATG CCG GCC ATG GGT GGA CTC TAC CAG TAC CAG TAC CAG CTG CCC AGC TAC CGC ACC AAC CCC TOT 
HANHHAQNHPAMGCLTQIfQYQLeSYRTSPC 



♦510 



c215A3 



GGG GTC TCC GCT GCC ATT CCC CCC TAC TAC TGA 



TTCATGATATTTC<&AAATCTCCTCTATCt^ 



I 



I 



♦617 



CZ15A3 WWAWAfGTtat^ATGACCAATAATATATG^ 

FiG. 7. Nucleotide sequence of a cDNA clone encoding a Af r 15,000 zein protein. The predicted amino 
acid sequence is shown below the nucleotide sequence. XXX indicates the major transcriptional start site, and the 
underscored sequence corresponds to the putative polyadenylation signal The overscored region indicates the 
position of an inverted nontandem repeated sequence. The NH r terminal and COOH-terrmnal positions of the 
mature polypeptide are indicated. 
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Fk;. 8. SI mapping of the 5' region of cZ15A3. A, a partial restriction enzyme map of the 5' end of cZl5A3 
and a region of the clone used in the Sl-mapping experiment. C lA indicates a G tract of 14 bases that was added 
daring the cloning procedure. The restriction enzyme sites upstream from the G tract are in the poly linker region 
of the plasmid vector pUC8. The probe was generated by end labeling a Ddel fragment with subsequent cleavage 
at the tfmdIII site in the poly linker. The vertical arrows indicate the limits of Si digestion as shown in B. B, 
Maxam and Gilbert sequencing reactions of the probe are shown in lane 1 (G reaction), lane 2 (G + A reaction), 
lane 'A (C + T reaction), and lane 4 fC reaction). The probe was hybridized to 1 fig of poly(A) RNA from 12 DAP 
(lane .51, 18 DAF (lane 6), 22 DAP {Uxne 7). and 28 DAP (lane 8) endosperms as described under "Materials and 
Methods." After the hybridization the probe was digested with Si nuclease and resolved on a sequencing gel. The 
sequence in the region'of the nuclease -protected bands and the location of the bands (arrows) are given on the left- 
hand margin. Restriction enzyme key: D, Dde\\ tfmdlll; N, Nde I; and /\ Pstl 
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Fig. 9. Mapping of the 3' region of cZ15A3. A, a partial restriction enzyme map of the 3'-terminal of 
cZ15A3 and the region of the clone used in the Si -mapping experiment. The C ia represents a C tract of 13 bases 
that was added during the cloning procedure (Marks et a£, 1986). The restriction enzyme sites downstream from 
the C tract are in the poly linker region of the plasmid vector, pUCo. The probe was generated by cleavage with 
hind, and it was 3' end labeled by using Klenow fragment. The labeled fragment was recut with HincU. The 
horizontal arrows show the position of inverted repeats (see Fig. 6), and the vertical arrows indicate the limit of Si 
digestion as shown in B. B, Maram and Gilbert sequencing reactions of the probe are shown in the first two lanes 
as indicated. The probe was hybridized to 1 fig of 18 DAP endosperm poly(A) RNA plus 200 M g tRNA (+RNA) or 
200 M g of tRNA without poly(A) RNA (-RNA) as described under "Materials and Methods." After hybridization 
the probe was digested with Si nuclease and resolved on a sequencing gel. The sequence in the region of the 
nuclease-protected bands and the location of the bands {arrows) are given at the left-hand margin. Restriction 
enzymes: B, Ball; C, Hincll; //, Hind; and P t Pst\. 



Fig. 10. Determination of the 3' 
terminus of mRNAs closely related 
to the Z19A/B clones. A, a partial re- 
striction enzyme map of the 3' coding 
and noncoding sequence of the clone 
gZl9AB-l (also called ZG99, Pedersen et 
at, 1982) and the region of the clone 
used in a Si -mapping experiment. The 
probe was made by cleavage with Ban I 
followed by 3' end labeling with Klenow 
fragment and a second restriction diges- 
tion with Hincll. The vertical arrows 
indicate the limits of Si digestion as 
shown in B. B, Maxam and Gilbert se- 
quencing reactions of the probe are 
shown in lane J (G + A reaction) and 
lane 2 (C + T reaction). The probe was 
hybridized to 1 /zg of 18 DAP endosperm 
poly(A) RNA plus 200 ^g of tRNA (lane 
3) and to 200 u% of tRNA alone {lane 4) 
as described under "Materials and Meth- 
ods."* After hybridization the probe was 
digested with Si nuclease and resolved 
on a squencing gel. The sequences in the 
region of the nuclease-protected bands 
and the location of the bands (arrows) 
are given on the left-hand margin of B. 
Restriction enzymes: Bnl, Ban\ and H, 
Hindi 
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found between nucleotides 614 and 642 (Fig. 7). To determine 
if the 3' end of the cDNA is authentic, we used a Hinfl to 
Hincll fragment 3' end labeled at the Hinfl site to probe 
endosperm poly (A) RNA (Fig. 9A). The only protected frag- 
ments extended a few bases into the terminal inverted repeat 
(Fig. 9B). Thus, this inverted repeat is apparently a cloning 
artifact and is not due to sequence heterogeneity. 

Analysis of Polyadenylation— Many of the M r 19,000 zein 
genes were found to contain two polyadenylation signals 
(Messing et aL, 1983). To quantitate the relative utilization 



of these signals in the genes belonging to the Z19A/B groups 
(Fig. 4) we did an Si -mapping experiment. A Banl to HincU 
fragment from the genomic clone 19AB-1 was 3' end labeled 
on the Baal site (Fig. 10A) and hybridized to endosperm 
poly(A) RNA (Fig. 10B). The probe should be protected by 
mRNA corresponding to the Z19A/B groups since these se- 
quences differ by only a few nucleotides in this region. Two 
major fragments that differ in size by seven nucleotides were 
protected (Fig. 10B). Both of these fragments extend past the 
second polyadenylation signal. Furthermore, the shorter and 
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longer fragments end precisely at the last nucleotides found 
in cZ19A-2 and cZl9B-l, respectively (Fig- 4). Several addi- 
tional bands, fainter in intensity, correspond to fragments 
ending three bases after the first polyadenylation signal. Since 
addition of the poly(A) tail normally occurs within 10-30 
nucleotides of the polyadenylation sequence (Nevins, 1983), 
these may not represent authentic 3' ends of the transcripts. 
They may instead represent a region of instability due to a 
long tract of Ts in this area. A few faint bands corresponding 
to mRNAs terminating 10-40 nucleotides beyond the first 
polyadenylation signal also appeared 

Heidecker and Messing (1983) have sequenced two cDNAs 
closely related to 19A/B groups that terminate after the first 
polyadenylation signal. Thus, the first site is probably used, 
but at a much lower frequency than the second Similar results 
were obtained with probes from the 19C and 19D groups (data 
not shown). 

DISCUSSION 

To study the heterogeneity among the zein mRNAs in the 
inbred W64A, we characterized the structure of a number of 
zein cDNA clones. Each of the three groups of M t 22,000 rein 
sequences is 60-70% homologous to the Af r 19,000 zein se- 
quences. Sequences among the Ai, 22,000 groups showed 92% 
homology, and sequences among the Mr 19,000 groups have 
homology ranging from 75 to 95%. The relationships among 
all these zein sequences are illustrated in Fig. 11, which 
suggests the order of duplications that gave rise to these 
groups. 

Hu et aL (1982) characterized a genomic clone, Z4, closely 
related to our Z19A/B groups that contains a large internal 
duplication. The duplication adds 30 amino acids to the 
encoded polypeptide and makes it the size of a M t 22,000 zein. 
By Northern blot analysis we found that in the W64A inbred 
this type of duplication appears to be restricted to genes 
within the Z19A/B group (Fig. 6). 

The duplication event suggests that the M r 22,000 zeins 
may have originated from a M t 19,000 gene, and close exam- 
ination of the M t 22,000 sequences provides evidence for this. 
The sequence from nucleotides 264-396 is 85% homologous 
to the sequence from nucleotides 397-540. The duplication 
that formed this region would have added approximately 40 
amino acids to the encoded polypeptide. A series of base 
substitutions would account for the subsequent diversion of 
the genes into two distinct gene families. 




FIG, 11. Comparison of sequence homology among zein 
mRNAs. The phytogeny was reconstructed with the method de- 
scribed by Fitch and Margoliash (1967) from the approximate per 
cent of nucleotide substitutions that have taken place among the zein 
genes. The percentages listed are the approximate per cent of nucleo- 
tide substitution between putative gene branching points. 



We have also sequenced a cDNA clone for the Af r 15,000 
zein which is nearly identical to that obtained for a genomic 
clone encoding this protein. 1 The sequences of the Af r 15,000 
zeins show no homology to those for either the A# r 19,000 or 
22,000 zeins, indicating that they comprise a separate family 
of genes. The Af f 15,000 zeins have a much higher G + C 
content, lack an internal repeating nucleotide sequence, and 
are present in only one or two copies in the genome (Wilson 
and Larkins, 1984). According to SI mapping, the mRNA 
encoding the M r 15,000 zein primarily begins 65-70 nucleo- 
tides before the start codon (Fig. 8). The sequence of cZl5A3 
differs from this by having a 5' noncoding sequence of over 
150 nucleotides. Based on Si mapping we estimate that the 
longer mRNA is 100-1000-fold less abundant than the major 
mRNA species (Fig. 8). Typical "TATA** boxes (Efotratiadis 
et aL, 1980) are found in the genomic clone just upstream 
from the start of both mRNA species suggesting that both 
start sites may be used. But it is also possible that the larger 
transcript is a precursor of the smaller one. The existence of 
zein genes with two transcriptional promoters is not without 
precedence. Langride and Feix (1983) analyzed a gene that 
encodes a M t 22,000 zein that appears to have two promoters. 
However, these promoters are separated by over 1000 nucleo- 
tides, and they have nearly equal levels of transcriptional 
activity. 

Of the five zein cDNA clones that contained poly(A) tails 
only three had the consensus sequence W AATAAA" which is 
found on the 3' terminus of many animal genes (Nevins, 
1983). This sequence was located 50-100 nucleotides before 
the start of the poly(A) tail, whereas in animal genes the 
polyadenylation signal is generally 10-30 nucleotides up- 
stream from the poiy(A) tail. Messing et aL (1983) identified 
several putative variants of the polyadenylation signal in zein 
cDNA sequences that were located closer to the poly(A) tail. 
In the A20 and A30 clones, which are closely related to our 
Z19C and C19A/B groups, respectively, they found the se- 
quence "AATAAG." We also found this sequence on the 3' 
terminus of the Z19C and Z19A/B clones (Fig. 4). In their 
B49 sequence (closely related to the Z22C group) another 
putative variant, "AATAAT," was found (Geraghty et aL, 
1982). This sequence is also found on the 3' terminus of our 
Z22B and Z22C clones (Fig, 2). 

In addition to the variation in the sequence of the poly- 
adenylation signal, some plant genes also differ from animal 
genes by having two potential polyadenylation sites (Messing 
et aL, 1983). Hu et aL (1982) reported the sequence of a 
genomic clone, Z4, that has a "AATAAA" sequence and a 
"AATAAG" sequence 26 and 65 bases, respectively, down- 
stream from the stop codon. They also isolated cDNA clones 
that terminated in a poly (A) tail after each of these sequences 
(Heidecker and Messing, 1983). These clones are closely re- 
lated to our cZl9A2 and cZl9Bl clones which end in a poly(A) 
tail after the second putative polyadenylation signal. 

Our results indicated that most of the zein mRNA that 
corresponds to the Z19A/B clones terminated after the second 
polyadenylation sequence. This appears to be in contrast to 
mRNAs closely related to the Af r 22,000 clone groups (Z22A, 
B, C). Three of four of the cDNA clones which belong to these 
groups represented mRNAs that were polyadenylated after 
the first polyadenylation sequence (Geraghty et a/., 1982; 
Spenaef aL, 1982). The remaining clone, cZ22C-2, represented 
an mRNA that was polyadenylated after a second polyadenyl- 
ation site (Fig. 2). Interestingly, the sequence that Spena et 
aL (1982) determined for the cDNA clone, zal, is identical to 
that of cZ22C2 up to the point at which zal ends in a pory(A) 
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tail. Thus, it appears as though there are preferred polyaden- 
ylation sites, but their utilization is variable. 
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